COMPREHENSIVE STUDY OF DEEP LEARNING BASED TELUGU OCR

نویسندگان

چکیده

The aim of the project is to understand offline One most popular and difficult pattern recognition subjects use optical character (OCR) read handwritten Telugu letters. This study suggests a three-stage OCR solution for documents that includes pre-processing, feature extraction, classification. For extraction boundary edge pixel points during preprocessing, we used median filtering on input characters as well normalisation skeletonization techniques. Each initially divided into three 3x3 grids stage, associated centroid each nine zones assessed. allows us recognise in various styles. Following that, drew projection angel's horizontal vertical symmetry character's closest pixel.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Telugu OCR Framework using Deep Learning

In this paper, we address the task of Optical Character Recognition(OCR) for the Telugu script. We present an end-to-end framework that segments the text image, classifies the characters and extracts lines using a language model. The segmentation is based on mathematical morphology. The classification module, which is the most challenging task of the three, is a deep convolutional neural networ...

متن کامل

A Survey of Telugu Ocr System

Optical character recognition is usually abbreviated as OCR. The object of OCR is automatic reading of optically sensed document text materials to translate human-readable characters into machine-readable codes. Today, reasonably efficient and inexpensive OCR packages are commercially available to recognize printed texts in widely used languages such as English, Chinese, and Japanese. These sys...

متن کامل

OCR for Telugu Script Using Back-Propagation Based Classifier

This paper deals with the theory and implementation of an Optical Character Recognition (OCR) system for printed Telugu script, which exploits the inherent characteristics of Telugu scripts, one of the major scheduled language of India, spoken by more than 66 million people, especially in South India. The principle idea is to convert images of text documents such as those obtained from scanning...

متن کامل

OCR of Printed Telugu Text with High Recognition Accuracies

Telugu is one of the oldest and popular languages of India spoken by more than 66 million people especially in South India. Development of Optical Character Recognition systems for Telugu text is an area of current research. OCR of Indian scripts is much more complicated than the OCR of Roman script because of the use of huge number of combinations of characters and modifiers. Basic Symbols are...

متن کامل

Candidate Search and Elimination Approach for Telugu OCR

In this paper we propose an OCR system for Telugu based on the candidate search and elimination technique. The initial candidates for recognition are found by applying a zoning method on input glyphs. We propose cavities as a structural approach suited specifically for Telugu script, where cavity vectors are used to prune the candidates found by zoning. A final template matching stage using con...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International journal of engineering technology and management sciences

سال: 2023

ISSN: ['2581-4621']

DOI: https://doi.org/10.46647/ijetms.2023.v07i03.133